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Abstract 

We present a method for the decomposition of mass spectra of mixture gases using Bayesian 
probability theory. The method works without any calibration measurement and therefore applies 
also to the analysis of spectra containing unstable species. For the example of mixtures of three 
different hydrocarbon gases the algorithm provides concentrations and cracking coefficients of each 
mixture component as well as their confidence intervals. The amount of information needed to 



obtain reliable results and its relation to the accuracy of our analysis are discussed. 
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I. INTRODUCTION 



Mass spectrometry is a standard technique for residual gas analysis and active processing 
control in vacuum devices. Traditional low resolution quadrupole mass spectrometers are 
widely used due to high sensitivity, reasonable stability wide operational pressure range, 
high scan speed and low costs. To be filtered in the quadrupole field, neutral gases have 
to be ionized first, most commonly by electron impact. At a typical electron energy of 
50-100 eV used to achieve a high ionization efficiency and stability analyte molecules can 
decompose into a variety of fragment ions leading to the so called cracking pattern(CP). 
The fragmentation is a molecular specific property which reflects the chemical structure and 
the atomic composition of the molecule. It can therefore be used for the decomposition of 
multicomponent mass spectra. 

A direct decomposition of mass spectra from mixture gases by successive subtraction 
of component contributions from the mixture signal, the wide spread pedestrian approach, 
works only if there are non-interfering mass numbers. A more elaborate method, least square 
evaluation, is not subject to this limitation and can incorporate measurement errors into the 
analysis |1|, 0. In both cases one must assume the CP of the components to be exactly known 
0]. The CP is, however, not an intrinsic property of molecules but depends on the particular 
mass spectrometer and operating parameters making the determination of CP nontrivial. 
The problem becomes even more severe if unstable species like radicals are among the 
components of a mixture, since in this case there are neither calibration measurements nor 
literature values for the CP available. In case of mixtures containing a single radical the 
CP of the radical is usually identified with the rest signal after subtraction of all component 
signals from the mixture signal. Because of the error propagation of measured signals the 
above procedure, however, provides only poor and sometimes unphysical (e.g. negative 
cracking coefficients) estimations. 

To overcome these difficulties and to treat measured spectra consistently we introduced 
a novel method using Bayesian probability theory ||. This was applied to the pyrolysis of 
azo-methane in order to evaluate concentrations and improved cracking coefficients together 
with their confidence intervals for the pyrolysis products including the methyl radical. In the 
present work we deal with another case of decomposition, in which mixtures made up given 
different compositions have to be decomposed without using any calibration measurements. 
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Because of lack of calibration measurements this method offers a practical use, in particular, 
in handling unknown mixture gases. But most importantly, this approach is able to treat 
mixtures containing unstable species, because exact CPs are not needed. 

As an example we synthesize mixtures of three different hydrocarbon gases, ethane(C2H 6 ), 
propane(C3H 8 ) and n-butane(C4H 10 ). The gas composition and the cracking coefficients of 
the mixture components will be estimated by independent experiments and compared with 
the results from the Bayesian analysis. 



II. BAYESIAN DATA ANALYSIS 



The procedure used in this work is similar to that used for the decomposition of the 
pyrolysis products of azo-methane introduced by Schwarz-Selinger et al. ||, where details 
of the calculation may be found. Here we present only the relevant steps for our algorithm. 

Assuming linear response of the mass spectrometer the mass signal dj of a mixture is the 
sum of contributions of all species in the mixture 

dj = C5cj + Ej , (f ) 

where j = 1, J with J mixtures. The vector dj has N elements representing the same 
number of mass channels. The composition vector Xj has the dimension M of the number 
of species in the mixture. C is a cracking matrix with M column vectors corresponding to 
the cracking patterns of M species. Since we are interested in compositions and cracking 
patterns but not in the absolute intensities, d,, and the column vectors of C are normalized 
with respect to the sum of intensities of all mass channels for practical reasons, which enforces 
in turn the normalization of 

N N M M N M 

^ ^ dnj ^ ^ ^ ^ C nm X m j ^ ^ %mj ^ Cnm ^ •^mj 1 > (2) 
n=l n=l m=l m=l n=l m=l 

where x m j is the m-th component of the j-th mixture. This sum rule has been overlooked 
in our previous work 0. The results in Ref. 3 approximate however closely the sum rule. 
Quite the same it does improve stability of the estimates on x and C if the relation is 
explicitly included in the analysis. The vector Sj in Eq. (1) represents the finite noise of the 
experiments. We assume that Ej is bias free, (£j) = 0, and has a variance of (e^) = cr^-. 
Note that for the absolute composition of the mixtures 5L- in (1) has to be scaled by a 



sensitivity factor of the mass spectrometer for molecules considered, which can be done by 
independent calibration experiments. Since it is beyond the scope of this work it will not 
be discussed. In the following we call it, concentration of a mixture. 

Now we can make use of the normalization condition (2) and reduce the number of 
unknown concentrations by 1. We split Cx^ in Eq. (1) into two parts 

M M-l 

CXj = ^ C m X m j = ^ C m X m j + CM%Mj • (3) 
m=l m=l 

Using x Mj = 1 - J2m=i Xmj, Cxj becomes 

M-l 

CXj = c M + 2j - c M )x m j ■ (4) 
Then Eq. (1) can be rewritten 

dj - c M = y^(c m - c M )^mj + £j • (5) 



m=l 



M-l 



m=l 



We define 



dj = dj - c M 

C = (ci - c M , c 2 - c M) — , c M -i - c M ) (6) 



and obtain the simplified model equation 

d j = Cx j + e j . (7) 

Knowing the model equation for the mass signal, which assumes (dj — Cxj) = and 
((dj — Cxj)^) = a^j, the principle of maximum entropy (3j leads to a Gaussian sampling 
distribution(Zz&e^7ioo(f) for the data set of the j-th mixture 

pidjlxj, C, Sj, I) = 1 exp i-\ (dj - Cx,f St 2 (dj - Cx,)} , (8) 



where (S~ 2 ) nn = V s rl/ and (S~ 2 ) n[ = for n ^ I, with s nj - being the n-th component of 
the sample variance Sj used for a n j. I denotes all background information available. We 
assume independent measurements. This means that the outcome of an experiment is not 
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influenced by any other data set. In such a case the likelihood for all data sets reduces to a 
product of the likelihoods of the individual measurements 



P 



;d|x, c, s, i) = lipids, c, Sj, i) . (9) 



This can easily be shown by repeated application of the product rule of the Bayesian 
probability theory p(A, B) = p(A) p(B\A). For reasons of convenience we used the notation 
{dj} = D, {xj} = X and {Sj} = S, where D and X are matrixes with J column vectors 
dj and Xj, respectively. 

Our task is to find the concentration Xj (j = 1, J) and the cracking coefficients c m 
(m = 1,...,M). Let us start with x,,-. The expectation value of x^ is derived from the 
probability density p(X|D, S, /) as 

(x fc ) = J dXx k p(X\B,S,I) . (10) 

p(X|D, S, /) is obtained from Bayes theorem as [|J 

p(X|D,S,/) =p(X.\I) p(D|X,S, I)/Norm , (11) 

in terms of the marginal likelihood p(D|X, S, /). The latter is derived by the marginalization 
rule 

p(D|X,S,7) = /"dCp(D,C|X,S,/) = JdCp(C\I) p(D|C,X,S,7) . (12) 

Note that p(C|X, S,I) = p(C\I), since the composition and the sample variance do not 
contain any information about the cracking coefficients. We use a flat prior for p(K\I) in 
the range of < x m j < x max j, since there exists no prior knowledge about the composition, 
and obtain 

(x fc ) ~ J dC p(C\I) j rfX x fe p(D|X, C, S, I) , (13) 

where the proportionality accounts for the normalization factor in Eq. (11) and the flat 
prior p(X|J). For the analytical integration of the inner integral in (13) we assume that the 
integrand is sharply peaked in the prior range for X, so that the integral does not change if 
the integral limits are extended to (— oo, oo). The likelihood p(D|X, C, S, I) in Eq. (13) is 
a multivariate Gaussian function (8,9), whose exponent can be transformed into a complete 
square 

(d, - C Xi ) T St 2 (d, - Cxj) = (x, - x j0 ) T Qj (x, - x, ) + R, . (14) 
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The integral of the right hand side of Eq. (14) over x, with range (—00, 00) returns 

expU^M |detQj|-3 , (15) 
and thus for the rightmost integral of (13) we obtain 

J dXx k p(D\X,C,S,I)=x k0 fJexp(-^)|detQ,|-5 , (16) 

where x^q, Rj and Qj result from the comparison of the coefficients in Eq. (14) 

Qj = C'j S ; 2 Cj 

x fc0 = Q^CfS^cU (17) 
Rj = dJSj 2 dj - xJoQj-Xj-o . 

The calculation of the matrix equations can be considerably simplified by using singular 
value decomposition (SVD) of Cj. For details readers are referred to Ref. 3 and 5. 

To complete the integration (13) the prior probability p(C|J) for the cracking coefficients 
has to be assigned. As a prior knowledge / we use the table of Cornu and Massot |§ , which 
provides only point estimates for CP. Then, by virtue of the maximum entropy principle 
@], the prior probability distribution of C given the tabulated value Cq is an exponential 
function 

p(c\I)=p(c\c )=exp(-X5)/Z , (18) 

where Cq is the literature value for the matrix element c. Eq. (18) is a general form of the 
exponential function having the normalization constant 1/Z and the scale factor A. The 
latter can be obtained from the requirement that the expectation value of c on the support 
< c < 1 must be equal to Co 

f 1 ,~ - , 1 1 - (1 + A) exp(-A) 

c = dec exp -Ac ) Z = \ \\ } = c . 19 

Jo A 1 - exp(-A) 

For p(C|Co) the prior probabilities p(c\co) of all elements of the matrix C have to be 
multiplied 

p{C\I)=p(C\C ) = Y[ P {c nm \c , nm ) . (20) 

nm 
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Inserting these results into Eq. (13) we have 

(Xfc)~ /" x fe0 P(C|/) J]exp(-^)|detQ J |-^ . (21) 

The integration over C is performed by Markov Chain Monte Carlo(MCMC) method|7j 
with the sampling density p(C) 

p(C)~p(C|J)IJ ffl q)(-^|detQ J |-i . (22) 

3 

For the second moment of we find 

(x 2 mk) = J dC [(Q^) mm + x 2 mk0 ] p(C) , (23) 

which allows to calculate the variance of 

(Aa& fc ) = (x 2 mk ) - (x mk ) 2 . (24) 

Now we have to assign the expectation value and the variance for Xmh- The calculation for 
{xMk) is straightforward 

M-l M-l 
m=l m=l 

Accordingly, for (x 2 Mk ) we have 

M-l M-l 

m=l i=l 
M-l M-l 

= 1 - 2 ^ (x mfc ) + (x mk xi k ) . (26) 

m=l m,/ 

From Eq. (25) and (26) it follows 

( A ^Mfc) = (XMk) ~ (^Mk) 2 

M-l M-l 

= 1 ~ 2 } u (x m k) + (XmkXlk) (27) 

m=l m,i 

M-l M-l 



Xmk 

m=l m,l 
yj {(XmkXlk) — (x m k) (Xlk)} 



M-l 



m, I 



The estimation of cracking coefficients proceeds in a similar way. First, Bayes theorem was 
applied to rewrite the required posterior probability p(C|D, S, 7) in terms of a prior on C 
and the marginal likelihood p(D|C, S, 7) 

p(C|D,S,7)~p(C|7)p(D|C,S,7) , (28) 

and by using the marginalization rule and the product rule, the likelihood p(D|C, S,7) 
becomes 

p(D|C,S,7) = J dXp(D,X|C,S,7) (29) 
= JdXp(X\I) p(D|X,C,S,7) . (30) 

Finally, we arrive at 

«) = J dC ~& m p(C) , (31) 

where /x = 1, 2 for the first and second moment for c m . The sampling density p(C) is the 
same as in Eq. (22). As a consequence C and X can be determined by a same MCMC 
sampling. 



III. RESULTS AND DISCUSSION 

All mass spectra presented in this work were taken with a quadrupole mass spectrometer 
(Hiden, HAL 201), which has the standard axial ion source and the secondary electron 
multiplier (SEM) for ion detection. The mass spectrometer chamber with a base pressure 
better than 1 • 10 _9 hPa is differentially pumped and connected with the main chamber by 
a capillary (0 1 mm, 10 mm long). Both chambers are pumped with a separate turbo 
molecular pump. The gas flows into the main chamber are controlled by three individual 
MKS flow controllers (type 1259C), which allow a nominal error of 0.8 %. During calibration 
as well as mixture measurements the pressure in the mass spectrometer chamber was kept 
at about 5 • 10~ 8 hPa to minimize any pressure effect on the cracking patterns. 

Fig. 1 depicts the mass signals of 6 mixtures of different gas compositions composed of 
ethane, propane and n-butane for the most intensive 14 mass channels. Note the strong 
similarity of the six spectra. The quoted ratio was determined from the gas flows into the 
main chamber and does not reflect the actual concentrations, since the gas flow through 
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the capillary as well as the pumping power in the main and the mass spectrometer chamber 
strongly depend on species. However, they still may serve as a crude estimate of the mixture 
ratio. 

The upper part in Fig. 2a shows calibration measurements of C2H 6 , C3H8 and C4H40 
together with their literature values taken from Cornu and Massot 0. The cracking co- 
efficients are normalized with respect to the sum of the intensities, so that they all have 
the total intensity of one making the direct comparison of cracking patterns from different 
sources possible. Although the rough structures of CP from literature and calibration mea- 
surements for the three gases look very similar, nearly every peak shows a clear deviation 
between calibration and literature values. This is actually expected since the CP depends on 
the geometry of ion sources, analyzer transmission and mass dependent secondary electron 
multiplier gain reflecting the specification of a particular mass spectrometer. The lower plots 
in Fig. 2a show the results of the Bayesian analysis on the mixture data in Fig. 1. The 
agreement of CP between calibration and our analysis is very good. For ethane the Bayesian 
results coincide exactly with those from the calibration, and for propane and butane only 
few points show a small deviation, namely peaks at m=28 and m=29, 43, respectively. But 
this deviation means not necessarily an error in the measurements or analysis. In contrast, 
it is conceivable that CP of pure gases is changed in a mixture by an influence of gases 
on each other through a simultaneous presence in the ion source or by variation of partial 
pressures M. 

Fig. 2b shows the concentrations of the components in six mixtures from the Bayesian 
analysis and from the independent estimation by pressure measurement. For the latter the 
partial pressure of known gas flows of pure gases was measured by a baratron gauge, which 
is mounted in the main chamber, and the flux rate of a mixture was rescaled to partial 
pressures of each gas. The error of this estimation was assessed to 10 %, which is mainly 
contributed by the long-term fluctuation of the total pressure and the tolerance of the flow 
controller. Our results in Fig. 2b are in a satisfactory agreement with the estimation by 
pressure measurement and rather accurate with about 3 % confidence interval. It can also 
be found that there is a systematic deviation between the two sources: the concentrations 
from the pressure estimation for ethane are lower than those from our analysis and the 
concentrations for butane show exactly the opposite. This may be explained by the mass 
dependent conductance through the capillary ||, which connects the mass spectrometer 
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and the main chamber with the gas inlet system. Since the conductance of the capillary 
is larger for ethane than for butane, the estimations by the baratron have to be increased 
for ethane and decreased for butane, which leads to a better agreement between the two 
determinations. In fact, the determination of the composition of a mixture is rather difficult 
and needs much care. A gaseous molecule in a mixture can affect the molecular kinetic of 
other gases and thus the pressure reading of pure gases cannot be directly combined with the 
mixture composition. If no calibration of pure gases is available as in the case of radicals, 
the pressure determination is even impossible. Our method in this case provides a unique 
way to yield concentrations of all components in the mixture. 

The accuracy of Bayesian analysis relies on the amount of information provided, which 
include prior knowledge, number of measurements or the availability of calibration measure- 
ments. It is therefore interesting and important to ask how much information or how many 
data sets we need to obtain reliable results from the equation system (7). A single mixture 
delivers 13 known and 2 unknown parameters (14 — 1 for dj and 3 — 1 for Xj due to the 
normalization), while the number of unknown cracking coefficients (=30) is unchanged. For 
the latter mass numbers with no significant signal compared to the background noise are 
neglected. For the analysis with 3 mixture data, for example, the number of known and 
unknown parameters are 39 and 36, respectively, illustrating an overdetermined equation 
system. This consideration, however, is only valid if the input data are linearly indepen- 
dent. Our data by no means meet this condition, as can be seen qualitatively in Fig. 1. 
Quantitatively, the significant singular values of the SVD of the data matrix gives the num- 
ber of sources, which effectively contribute to solving the equation system. For the data 
matrix with six column vectors only three singular values are significant (Table 1) indicat- 
ing that at least six data sets would be needed for the reliable decomposition of the mixtures 
by the purely mathematical treatment. In contrast, for the Bayesian approach each data set 
contributes to increase the accuracy of the analysis on the basis of prior knowledge. As can 
be seen in Fig. 2, six mixtures were proved to contain enough information to decompose the 
mixtures. Let us now reduce the input data down to three mixtures. In Fig. 3 the results 
from three data sets are compared to those from six data sets. The point estimations of the 
cracking coefficients are nearly unchanged, but the error bars became somewhat broader, 
which seems to be reasonable because of less information. The concentrations derived from 3 
and 6 mixtures also show no significant change. These very satisfactory outcomes from three 
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mixtures would not be possible by directly resolving the equation system (7). It should be 
also noted that the analysis with two mixture data illustrating a superior number of unknown 
parameters leads to a strong uncertainty in the estimation of both cracking coefficients and 
concentrations. 

We can also even increase the accuracy by adding calibration measurements to our anal- 
ysis. Fig. 4 shows the results deduced from 7 data sets including the 6 mixture data and the 
calibration measurement for butane. One can see that the deviation of cracking coefficients 
at m=29, 43 for butane shown in Fig. 2a entirely vanishes and that the overall error signif- 
icantly decreases. This demonstrates that the incorporation of calibration measurements is 
an efficient way to improve the accuracy of our analysis. This may be possible even for very 
unknown mixtures, since in most cases one can infer from the source of mixture gases one 
or more of its components. 



IV. SUMMARY 



We introduced a method for decomposition of multicomponent mass spectra using 
Bayesian probability theory, which was used to derive concentrations and cracking coef- 
ficients of mixture components as well as their confidence interval without any use of cal- 
ibration measurements. For synthesized mixtures from ethane, propane and butane the 
algorithm was able to deliver precise results, which were compared with independent esti- 
mations. This example exhibits a very challenging case, in which the mass spectra of the 
mixtures show only a moderate difference from each other. Since the number of available 
mixture data does not restrict the application of our method but merely affects the accuracy 
of the analysis, even cases in which limited number of mixtures is available can be processed 
by our Bayesian approach. For unknown mixtures, if only few components are assumed to 
be known from the source of the mixtures, the outcome of our analysis for the concentrations 
serves as a critical criterion to decide whether a certain species is among the constituents of 
the mixture. 

Future applications include the decomposition of neutral particle fluxes from low temper- 
ature hydrocarbon plasmas including larger radicals employed in thin-film deposition and 
etching. The radicals are believed to play an crucial role in film formation at the plasma- 
surface boundary |L0, [UJ . Their quantification, which will be possible by the present method, 
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is therefore prerequisite for understanding of the microscopic film growth process. In this 



case mixture data are easily available, e.g. by variation of plasma parameters [10 
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Table 1 Eigenvalues of the singular value decomposition (SVD) of the data matrix with 
six column vectors shown in Fig. 1. 

Fig.l: Mass signals of six different gas mixtures composed of ethane(C 2 H 6 ), 
propane(C3H 8 ) and n-butane(C4Hi ). The mixture rates given for C2H 6 :C3H 8 :C4Hi 
were determined from gas flows through the gas-inlet device. 

Fig.2: (a) upper: cracking patterns from calibration measurements and literature taken 
from Ref. 6. The mass signals were normalized to the sum of the intensities, lower: 
Calibration measurements vs. Bayesian analysis using the mixture data in Fig. 1. (b) 
component concentration of the 6 mixtures from pressure measurement (grey) and from 
the Bayesian analysis (black). E, P and B in x-axis stand for ethane, propane and butane, 
respectively. See text for error determination. 

Fig.3: Cracking patterns (a) and concentrations (b) of mixture components from 
Bayesian analysis using 3 (black) and 6 (grey) mixtures. 

Fig.4: Cracking patterns from calibration measurements and Bayesian analysis using 6 
mixtures and the calibration measurement of butane. 



14 



0.9889 0.1126 0.0716 0.0014 0.0007 0.0006 



TABLE I: Eigenvalues of the singular value decomposition (SVD) of the data matrix with six 
column vectors shown in Fig. 1. 
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